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Abstract 

The graph removal lemma states that any graph on n vertices with o{n^^^'^) copies of a fixed 
graph H may be made H-bee by removing o(n^) edges. Despite its innocent appearance, this lemma 
and its extensions have several important consequences in number theory, discrete geometry, graph 
theory and computer science. In this survey we discuss these lemmas, focusing in particular on 
recent improvements to their quantitative aspects. 

1 Introduction 

The triangle removal lemma states that for all e > there exists 6 > such that any graph on n 
vertices with at most 6n^ triangles may be made triangle-free by removing at most en^ edges. This 
result, proved by Ruzsa and Szemeredi ^93j in 1976, was originally stated in rather different language. 

The original formulation was in terms of the (6, 3)-problem0 This asks for the maximum number of 
edges /'■'^^(n, 6, 3) in a 3-uniform hypergraph on n vertices such that no 6 vertices contain 3 edges. 
Answering a question of Brown, Erdos and Sos [19], Ruzsa and Szemeredi showed that f^^\n,6,3) = 
o(n^). Their proof used several iterations of an early version of Szemeredi's regularity lemma |110j . 

This result, developed by Szemeredi in his proof of the Erdos- Turan conjecture on arithmetic progres- 
sions in dense sets [109j, states that every graph may be partitioned into a small number of vertex 
sets so that the graph between almost every pair of vertex sets is random-like. Though this result now 
occupies a central position in graph theory, its importance only emerged over time. The resolution of 
the (6, 3)-problem was one of the first indications of its strength. 

The Ruzsa-Szemeredi theorem was generalized by Erdos, Frankl and Rodl [32j, who showed that 
f^^\n,3r — 3,3) = o(n^), where f^'^\n,3r — 3,3) is the maximum number of edges in an r-uniform 
hypergraph such that no 3r — 3 vertices contain 3 edges. One of the tools used by Erdos, Frankl and 
Rodl in their proof was a striking result stating that if a graph on n vertices contains no copy of a 
graph H then it may be made K^-free, where r = x{H) is the chromatic number of H, by removing 
o(n^) edges. The proof of this result used the modern formulation of Szemeredi's regularity lemma 
and is already very close, both in proof and statement, to the following generalization of the triangle 
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removal lemma, known as the graph removal lemmao This was first stated explicitly in the literature 
by Alon, Duke, Lefmann, Rodl and Yuster [4J and by Fiiredi [47J in 19941 Note that we use v{H) to 
denote the number of vertices in a graph (or hypergraph) H. 

Theorem 1.1 For any graph H and any e > 0, there exists 6 > such that any graph on n vertices 
which contains at most 5n"^^'> copies of H may be made H-free by removing at most ev? edges. 

It was already observed by Ruzsa and Szemeredi that the (6, 3)-problem (and, thereby, the triangle 
removal lemma) is related to Roth's theorem on arithmetic progressions [21] ■ This theorem states that 
for any 5 > there exists an uq such that if n > no then any subset of the set [n] := {1, 2, . . . , re} of 
size at least 5n contains an arithmetic progression of length 3. Letting r^[n) be the largest integer such 
that there exists a subset of the set {1,2, . . . ,re} of size r^{n) containing no arithmetic progression 
of length 3, this is equivalent to saying that r^[n) = o{n). Ruzsa and Szemeredi observed that 
/(^)(re, 6,3) = Q.(r^{n)n). In particular, since /'^^-'(re, 6, 3) = o(re^), this implies that r^(n) = o{n), 
yielding a proof of Roth's theorem. 

It was further noted by Solymosi [104j that the Ruzsa-Szemeredi theorem yields a stronger result of 
Ajtai and Szemeredi [T]. This result states that for any 6 > there exists an no such that if re > reo then 
any subset of the set [n] x [re] of size at least 6n'^ contains a set of the form {(a, 6), {a + d,b), {a,b + d)}. 
That is, dense subsets of the 2-dimensional grid contain axis-parallel isosceles triangles. Roth's theorem 
is a simple corollary of this statement. 

Roth's theorem is the first case of a famous result known as Szemeredi's theorem. This result, to which 
we alluded earlier, states that for any natural number A; > 3 and any 5 > there exists reo such that 
if re > reo then any subset of the set [re] of size at least 5n contains an arithmetic progression of length 
k. This was first proved by Szemeredi ^09j in the early seventies using combinatorial techniques and 
since then several further proofs have emerged. The most important of these are that by Furstenberg 
[4:8\ [50] using ergodic theory and that by Gowers [531 154] . who found a way to extend Roth's original 
Fourier analytic argument to general k. Both of these methods have been highly influential. 

Yet another proof technique was suggested by Frankl and Rodl [12]. They showed that Szemeredi's 
theorem would follow from the following generalization of Theorem I l.l) referred to as the hypergraph 
removal lemma. They proved this theorem for the specific case of , the complete 3- uniform 
hypergraph with 4 vertices. This was then extended to all 3- uniform hypergraphs in |77] and to K^"^^ 
in [89]. Finally, it was proved for all hypergraphs by Gowers [55] [56] and, independently, by Nagle, 
Rodl, Schacht and Skokan j78[ I88j. Both proofs rely on extending Szemeredi's regularity lemma to 
hypergraphs in an appropriate fashion. 

Theorem 1.2 For any k-uniform hypergraph % and any e > 0, there exists (5 > such that any 
k-uniform hypergraph on re vertices which contains at most 6n'"^^^ copies ofH may be made H-free by 
removing at most en'' edges. 

■^The phrase 'removal lemma' is a comparatively recent coinage. It seems to have come into vogue in about 2005 when 
the hypergraph removal lemma was first proved (see, for example, [671 1781 [1061 1112j \ 

^This was also the first time that the triangle removal lemma was stated explicitly, though the weaker version 
concerning graphs where every edge is contained in exactly one triangle had already appeared in the literature. The 
Ruzsa-Szemeredi theorem was usually [401 1411 146| phrased in the following suggestive form: if a 3-uniform hypergraph 
is linear, that is, no two edges intersect on more than a single vertex, and triangle- free, then it has o(n^) edges. A more 
explicit formulation may be found in [23] . 
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As well as reproving Szemeredi's theorem, the hypergraph removal lemma allows one to reprove the 
multidimensional Szemeredi theorem. This theorem, originally proved by Furstenberg and Katznelson 
|49j . states that for any natural number r, any finite subset S of Z'' and any 6 > there exists no 
such that if n > no then any subset of [nY' of size at least Sn^' contains a subset of the form a ■ S + d, 
that is, a dilated and translated copy of S. That it follows from the hypergraph removal lemma 
was first observed by Solymosi [105| . This was the first non-ergodic proof of this theorem. A new 
proof of the special case S = {(0, 0), (1, 0), (0, 1)}, corresponding to the Ajtai-Szemeredi theorem, was 
given by Shkredov }102j using a Fourier analytic argument. Recently, a combinatorial proof of the 
density Hales-Jewett theorem, which is an extension of the multidimensional Szemeredi theorem, was 
discovered as part of the polymath project [8T] . 

As well as its implications in number theory, the removal lemma and its extensions are central to 
the area of computer science known as property testing. In this area, one would like to find fast 
algorithms to distinguish between objects which satisfy a certain property and objects which are far 
from satisfying that property. This field of study was initiated by Rubinfield and Sudan [92] and, 
subsequently, Goldreich, Goldwasser and Ron [51] started the investigation of such property testers 
for combinatorial objects. Graph property testing has attracted a particular degree of interest. 

A classic example of property testing is to decide whether a given graph G is e-far from being triangle- 
free, that is, whether at least en^ edges will have to removed in order to make it triangle-free. The 
triangle removal lemma tells us that if G is e-far from being triangle free then it must contain at least 
dn^ triangles for some 6 > depending only on e. This furnishes a simple probabilistic algorithm for 
deciding whether G is e-far from being triangle- free. We choose t = 26~^ triples of points from the 
vertices of G uniformly at random. If G is e-far from being triangle-free then the probability that none 
of these randomly chosen triples is a triangle is (1 — 5)* < e~^^ < |. That is, if G is e-far from being 
triangle-free we will find a triangle with probability at least |, whereas if G is triangle- free we will 
clearly find no triangles. The graph removal lemma may be used to derive a similar test for deciding 
whether G is e-far from being H-iree for any fixed graph H. 

In property testing, it is often of interest to decide not only whether a graph is far from being H-hee 
but also whether it is far from being induced H-fiee. A subgraph H' of a graph G is said to be an 
induced copy of H if there is a one-to-one map / : V{H) — V{H') such that (/(u),/(u)) is an edge 
of H' if and only if (u, v) is an edge of H. A graph G is said to be induced H-fiee if it contains no 
induced copies of H and e-far from being induced H-free if we have to add and/or delete at least en? 
edges to make it induced H-fiee. Note that it is not enough to delete edges since, for example, if H 
is the empty graph on two vertices and G is the complete graph minus an edge, then G contains only 
one induced copy of H, but one cannot simply delete edges from G to make it induced if-free. 
By proving an appropriate strengthening of the regularity lemma, Alon, Fischer, Krivelevich and 
Szegedy [6] showed how to modify the graph removal lemma to this setting. This result, which allows 
one to test for induced i7-freeness, is known as the induced removal lemma. 

Theorem 1.3 For any graph H and any e > 0, there exists a 6 > such that any graph on n vertices 
which contains at most Sn"^^^ induced copies of H may he made induced H-free by adding and/or 
deleting at most ev? edges. 

A substantial generalization of this result, known as the infinite removal lemma, was proved by Alon 
and Shapira |12j (see also [75]). They showed that for each (possibly infinite) family 'H, of graphs and 
e > there is 5 = (5->^(e) > such that if a graph G on n vertices contains at most bn^^^^ induced 
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copies of H for every graph H inT-L, then G may be made induced H-hee, for every H & T-L, hy adding 
and/or deleting at most en^ edges. They then used this result to show that every hereditary graph 
property is testable, where a graph property is hereditary if it is closed under removal of vertices. These 
results were extended to 3- uniform hypergraphs by Avart, Rodl and Schacht [U] and to fc-uniform 
hypergraphs by Rodl and Schacht [86] . 

In this survey we will focus on recent developments, particularly with regard to the quantitative 
aspects of the removal lemma. In particular, we will discuss recent improvements on the bounds for 
the graph removal lemma. Theorem II. H and the induced graph removal lemma, Theorem 11.31 each of 
which bypasses a natural impediment. 

The usual proof of the graph removal lemma makes use of the regularity lemma and gives bounds 
for the removal lemma which are of tower-type in e. To be more specific, let r(l) = 2 and, for each 
i > 1, T{i + 1) = 2^(*). The bounds that come out of applying the regularity lemma to removal then 
say that if (5~^ = T{e~'^") then any graph with at most 6n'"^^^ copies of H may be made H-iiee by 
removing at most en^ edges. Moreover, this tower-type dependency is inherent in any proof employing 
regularity. This follows from an important result of Gowers [52] (see also [24J) which states that the 
bounds that arise in the regularity lemma are necessarily of tower type. We will discuss this in more 
detail in Section [2T] below. 

Despite this obstacle, the following improvement was made by Fox [38] . 

Theorem 1.4 For any graph H, there exists a constant ajj such that if S^^ = T(a// log e^^) then any 
graph on n vertices which contains at most 6n'"^^^ copies of H may he made H-free by removing at 
most en^ edges. 

As is implicit in the bounds, the proof of this theorem does not make an explicit appeal to Szemeredi's 
regularity lemma. However, many of the ideas used are similar to ideas used in the proof of the 
regularity lemma. The chief difference lies in the fact that the conditions of the removal lemma 
(containing few copies of a given graph H) allow us to say more about the structure of these partitions. 
A simplified proof of this theorem will be the main topic of Section 12.21 

Though still of tower-type, Theorem 11.41 improves substantially on the previous bound. However, it 
remains very far from the best known lower bound on . The observation of Ruzsa and Szemeredi 
[93] that /(3)(n,6,3) = VL {r^{n)n) allows one to transfer lower bounds for r3(n) to a corresponding 
lower bound for the triangle removal lemma. The best construction of a set containing no arithmetic 
progression of length 3 is due to Behrend [16] and gives a subset of [n] with density g-cViogn^ Trans- 
ferring this to the graph setting yields a graph containing e'^^°§'^ triangles which cannot be made 
triangle-free by removing fewer than en^ edges. This quasi-polynomial lower bound, 5^^ > e^'^'°s^ , 
remains the best knownj^ 

The standard proof of the induced removal lemma uses the strong regularity lemma of Alon, Fischer, 
Krivelevich and Szegedy [6] . We will speak at length about this result in Section 13.11 Here it will 
suffice to say that, like the ordinary regularity lemma, the bounds which an application of this theorem 

''it is worth noting that the best known upper bound for Roth's theorem, due to Sanders [55], is considerably better 
than the best upper bound for r3(n) that follows from triangle removal. This upper bound is rz{n) — O ^ ■ 

A recent result of Schoen and Shkredov [99|, building on further work of Sanders shows that any subset of [n] of 

^1 log" -il/S 

density e ^logiogn' contains a solution to the equation xi + ■ ■ ■ + xs = 5xq. Since arithmetic progressions correspond 
to solutions of xi + X2 = 2x3, this suggests that the answer should be closer to the Behrend bound. The bounds for 
triangle removal are unlikely to impinge on these upper bounds for some time, if at all. 
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gives for the induced removal lemma are necessarily very large. Let W^(l) = 2 and, for i > 1, 
W{i + 1) = T(W{i)). This is known as the wowzer function and its values dwarf those of the usual 
tower function^ By using the strong regularity lemma, the standard proof shows that we may take 
= W{aH^~'^) in the induced removal lemma, Theorem II .31 Moreover, as with the ordinary removal 
lemma, such a bound is inherent in the application of the strong regularity lemma. This follows from 
recent results of Conlon and Fox p4] and, independently, Kalyanasundaram and Shapira [61j showing 
that the bounds arising in strong regularity are necessarily of wowzer type. 

In the other direction, Conlon and Fox [23] showed how to bypass this obstacle and prove that the 
bounds for are at worst a tower in a power of e~^. 

Theorem 1.5 There exists a constant c > such that, for any graph H, there exists a constant an 
such that if 6^^ = T{aHe^^) then any graph on n vertices which contains at most Sn"^^^ induced 
copies of H may be made induced H-free by adding and/or deleting at most en? edges. 

A discussion of this theorem will form the subject of Section [3.21 The key observation here is that the 
strong regularity lemma is used to prove an intermediate statement (Lemma 13.21 below) which then 
implies the induced removal lemma. This intermediate statement may be proved without recourse to 
the full strength of the strong regularity lemma. There are also some strong parallels with the proof 
of Theorem 11.41 which we will draw attention to in due course. 

In Section \2>.2>\ we present the proof of Alon and Shapira's infinite removal lemma. In another paper, 
Alon and Shapira [11] showed that the dependence in the infinite removal lemma can depend heavily 
on the family %. They proved that for every function 5 : (0, 1) — )• (0, 1), there exists a family T-L of 
graphs such that any 5-^ : (0, 1) — t- (0, 1) which satisfies the infinite removal lemma for Ti satisfies 
6-}{ = o{S). However, such examples are rather unusual and the proof presented in Section [3.31 of the 
infinite removal lemma implies that for many commonly studied families Ti of graphs the bound on 

is only tower-type, improving the wowzer-type bound from the original proof. 
Our discussions of the graph removal lemma and the induced removal lemma will occupy the bulk 
of this survey but we will also talk about some further recent developments in the study of removal 
lemmas. These include arithmetic removal lemmas (Section |3| and the recently developed sparse 
removal lemmas which hold for subgraphs of sparse random and pseudorandom graphs (Section [5]). 
We will conclude with some further comments on related topics. 

2 The graph removal lemma 

In this section we will discuss the two proofs of the removal lemma. Theorem 11.11 at length. In 
Section 12.11 we will talk about the regularity lemma and the usual proof of the removal proof. Then, 
in Section [2.21 we will consider a simplified variant of the second author's recent proof [38], showing 
how it connects to the weak regularity lemma of Frieze and Kannan |44l |35] . 

2.1 The standard proof 

We begin with the proof of the regularity lemma and then deduce the removal lemma. For vertex 
subsets S*, r of a graph G, we let edS, T) denote the number of pairs in x T that are edges of G and 

^To give some indication, we note that M^(2) = 4, W^(3) = 65536 and VK(4) is a tower of 2s of height 65536. 
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dciS, T) = clsnote the fraction of pairs in S" x T that are edges of G. For simphcity of notation, 

we drop the subscript if the graph G is clear from context. A pair (5, T) of subsets is e-regular if, 
for all subsets S' C S and T' C T with \S'\ > e\S\ and \T'\ > e\T\, we have \d{S',T') - d{S,T)\ < e. 
Informally, a pair of subsets is e-regular with a small e if the edges between S and T are uniformly 
distributed among large subsets. 

Let G = (y, E) be a graph and P : y = Vi U . . . U be a vertex partition of G. The partition of P is 
equitable if each pair of parts differ in size by at most 1. The partition P is e-regular if all but at most 
ek"^ pairs of parts iVi^Vj) are e-regular. We next state Szemeredi's regularity lemma. 

Lemma 2.1 For every e > 0, there is K = K[e) such that every graph G = {V^E) has an equitable, 
e-regular vertex partition into at most K parts. Moreover, we may take K to be a tower of height 
Die-'). 

Let q : [0,1] — )• M be a convex function. For vertex subsets S,T C V a graph G, let q{S, T) = 
q{d{S,T))\-^. For partitions 5 : 5 = Si U . . . U 5a and T : T = Ti U . . . U T;,, let q{S,T) = 
X^i<j<a i<i<b Qi'^i^'^j)- Fo'^ a vertex partition P : V = ViU . . .UVk oi G, define the mean-g density to 

be ~ ~ ' ~ ~ 

q{P)=q{P,P)= J2 QiVi,Vj). 

l<i,j<k 

We next state some simple properties which follow from Jensen's inequality using the convexity of q. 
A refinement of a partition P of a vertex set V is another partition Q oiV such that every part of Q 
is a subset of a part of P. 

Proposition 2.1 1. For partitions S and T of vertex subsets S and T , we have q{S, T) > q{S, T). 

2. If Q is a refinement of P, then q{Q) > q{P)- 

3. If d = d{G) = d(y, V) is the edge density of G, then, for any vertex partition P, 

q{d) < q{P) < dq{l) + (1 - d)q{Q). 

The first and second part of Proposition 12.11 show that by refining a vertex partition the mean-q density 
cannot decrease, while the last part gives the range of possible values for q{P) if we only know the 
edge density d oi G. 

The convex function q{x) = x"^ for x G [0, 1] is chosen in the standard proof of the graph regularity 
lemma and we will do the same for the rest of this subsection. The following lemma is the key claim 
for the proof of the regularity lemma. The set-up is that we have a partition P which is not e-regular. 
For each pair iVi^Vj) of parts of P which is not e-regular, there are a pair of witness subsets Vij,Vji 
to the fact that the pair of parts is not e-regular. We consider the coarsest refinement Q of P so that 
each witness subset is the union of parts of Q. The lemma concludes that the number of parts of Q is 
at most exponential in the number of parts of P and, using a Cauchy-Schwarz defect inequality, that 
the mean-g density of the partition Q is substantially larger than the mean-q density of P. Because 
it simplifies our calculations a little, we will assume, when we say a partition is equitable, that it is 
exactly equitable, that is, that all parts have precisely the same size. This does not affect our results 
substantially but simplifies the presentation. 
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Lemma 2.2 If an equitable partition P : V = ViL) . . .L)Vk is not e-regular then there is a refinement 
Q of P into at most k2^ parts for which q{Q) > q{P) + e^- 

Proof: For each pair {Vi,Vj) which is not e-regular, there are subsets Vij C Vi and Vji C Vj with 
{Vijl > and \Vji\ > e|V^| such that \d{Vij,Vji) — d{Vi,Vj)\ > e. For each part Vj such that {Vi,Vj) 
is not e-regular, we have a partiton Pij of Vi into two parts Vij and Vi \ Vij. Let Pi be the partition 
of Vi which is the common refinement of these at most k — 1 partitions of Vi, so Pi has at most 2^~^ 
parts. We let Q be the partition of V which is the union of the k partitions of the form Pi, so Q has 
at most k2^~^ parts. We have 

q{Q)-qiP) = Y.^q{n,Pj)-q{Vi,V,)) 

i,j 

> Yl {q{P^,P,)-q{V,V,)) 

(Vi,Vj) irregular 

> {q{P^J,Pj^)-q{V„V,)) 

{Vi,Vj) irregular 

E E ^^id{u,w)-div,v,)f 

(y,,V,) irregular (7GPij,W/GPji ' ' 

> E ^-^W^id{V^^V,.)-d{V.,V,)f 

{Vi,Vj) irregular 

where the first and third inequalities are by noting that the summands are nonnegative and the second 
inequality follows from the first part of Proposition 12.11 which shows that the mean-g density cannot 
decrease when taking a refinement. In the fourth inequality, we used that \Vij\ > e|I^| > ^\V\ and 
similarly for | Vji \ . Finally, the equality in the fourth line follows from the identity 

Y \U\\W\d{V„V,) = Y \U\\W\d{U,W), 

which counts e{Vi,Vj) in two different ways. This completes the proof. □ 

The next lemma, which is rather standard, shows that for any vertex partition Q, there is a vertex 
equipartition P' with a similar number of parts to Q and mean-square density not much smaller than 
the mean-square density of Q. It is useful in density increment arguments where at each stage one 
would like to work with an equipartition. It is proved by first arbitrarily partitioning each part of Q 
into parts of order except possibly one additional remaining smaller part, and then arbitrarily 

partitioning the union of the smaller remaining parts into parts of order 

Lemma 2.3 Let G = {V, E) he a graph and Q : V = ViiJ . . .UV^ he a vertex partition into k parts. 
Then, for q(x) = , there is an equitable partition P' ofV into t parts such that q{P') > q{Q) — 2|. 

Combining Lemmas 12.21 and 12.31 with t = 4e~^|(5| < e~^/c2'^+^, we obtain the following corollary. 
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Corollary 2.1 If an equitable partition P : V = ViU . . .UVk is not e-regular then there is an equitable 
refinement P' of P into at most e^^k2^^'^ parts for which q{P') > q{P) + 

We next show how Szemeredi's regularity lemma, Lemma [2. 11 can be quickly deduced from this result. 

Proof: To prove the regularity lemma, we start with the trivial partition Pq into one part, and 
iterate the above corollary to obtain a sequence Pq, Pi, . . . , Ps of equitable partitions with ^(Pj+i) > 
g(Pi) + eV2 until we arrive at an equitable e-regular partition Pg. As the mean-square density of each 
partition has to lie between and 1, after at most 2e~^ iterations we arrive at the equitable e-regular 
partition Pg with s < 2e~^. The number of parts increases by one exponential in each iteration, 
giving the desired number of parts in the regularity partition. This completes the proof of Szemeredi's 
regularity lemma. □ 

The constructions of Gowers [52j and the authors ^24] show that the tower-type bound on the number 
of parts in Szemeredi's regularity lemma is indeed necessary. In particular, the construction in |24] 
shows that K{e) in Lemma l2.ll is at least a tower of twos of height r2(e^^). The constructions are 
formed by reverse engineering the upper bound proof. We construct a sequence Pq, . . . , Ps of partitions 
with s = r2(e~^). As in the upper bound proof, each partition in the sequence uses exponentially more 
parts than the previous partition in the sequence. We may choose the edges using these partitions 
and some randomness so as to guarantee that none of these partitions (except the last) are e-regular. 
Furthermore, we can guarantee that any partition that is e-regular must be close to being a refinement 
of the last partition in this sequence. This implies that the number of parts must be at least roughly 

We next prove the graph removal lemma, Theorem ll.il from the regularity lemma. 

Proof: Let m denote the number of edges of if, so m < (2). Let 7 = and 5 = (2/i)^^^e™ir^'*, 
where K = K{'^) is as in the regularity lemma. We apply the regularity lemma to G and obtain an 
equitable, 7-regular partition into k < K parts. If the number n of vertices of G satisfies n < 5~^/^, 
then the number of copies of if in G is at most 5n^ < 1 and G is ii-free, in which case there is 
nothing to prove. So we may assume n > 5~^/^ . We obtain a subgraph G' of G by removing edges of 
G between all pairs of parts which are not 7-regular or which have edge density at most e. As there 
are at most ^k'^ ordered pairs of parts which are not 7-regular and each part has order at most 2n/k, 
at most {''fk'^ /2){2n/k)'^ = 2'yn^ edges are deleted between pairs of parts which are not 7-regular. The 
number of edges between parts which have edge density at most e is at most en^/2. Hence, the number 
of edges of G deleted to obtain G' is at most 2771^ + en^/2 < en^. If G' is ii-free, then we are done. 

Assume for contradiction that G' is not ii-free. A copy of H in G' must have its edges going between 
pairs of parts which are both 7-regular and have density at least e. Hence, there is a mapping from 
V{H) to the partition of V{G) so that each edge of H maps to a pair of parts which is both 7-regular 
and have edge density at least e. But the following standard counting lemma (see, e.g., Lemma 3.2 
in Alon, Fischer, Krivelevich and Szegedy |6] for a minor variant) shows that the number of labeled 
copies of H in G' (and hence in G) is at least 2~^e"^{n/2k)^ > h\5nP'. This contradicts that G has at 
most 5n^ copies of ii, completing the proof. □ 

Lemma 2.4 If H is a graph with vertices 1, . . . ,h and m edges and G is a graph with not necessarily 
disjoint vertex subsets Wi, . . . ,Wh such that \Wi\ > 7"^ for 1 < i < h and, for every edge of 
H , the pair {Wi,Wj) is ^-regular with density d{Wi,Wj) > e and 7 < then G contains at least 
2~^e"^\Wi\ X • • • X \Wh\ labeled copies of H with the copy of vertex i in Wi. 
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The standard proof of this counting lemma uses a greedy embedding strategy. One considers embed- 
ding the vertices one at a time, using the regularity condition to maintain the property that at each 
step where vertex i of H is not yet embedded the set of vertices of G which could potentially be used 
to embed vertex i is large. 



2.2 An improved bound 

A partition P : V = ViL) . . .L)Vk of the vertex set of a graph G = {V, E) is weak e-regular if, for all 
subsets S,T C V, we have 



e(5,T)- J2 \SnVi\\TnVj\d{v,,Vj: 



l<i,j<k 



<e\V\' 



That is, the density between two sets may be approximated by taking a weighted average over the 
densities between the sets which they intersect. 

The Frieze-Kannan weak regularity lemma |44^ H5] states that any graph has such a weak regular 
partition. 

Lemma 2.5 Let R{e) = 2^'^ , where c is an absolute constant. For every graph G = (V, E) and every 
equitable partition P of G into k parts, there is an equitable partition P' which is a refinement of P 
into at most kR{e) parts which is weak e-regular. 

Unlike the usual regularity lemma, the bounds in Lemma 12.51 are quite reasonablelfl It is therefore 
natural to try to apply it to prove the removal lemma. However, it seems unlikely that this lemma is 
itself sufficient to prove the removal lemma, since it only gives control over edge densities of a global 
nature. 

However, as noted by Tao j lllj (see also |87J), one can prove a stronger theorem by simply iterating 
the Frieze-Kannan weak regularity lemmaUl Tao developed this lemma to give an alternative proof of 
the regularity lemmslf] which extended more easily to hypergraphs [112] . Here we use it to improve 
the bounds for removal. 

Lemma 2.6 Let q : [0,1] ^ M be a convex function, G be a graph with d = d{G), / : N — )■ [0, 1] be a 
decreasing function and r = {dq{l) -|- (1 — d)q{0) — q{d)) /j. Then there are equitable partitions P and 
Q with Q a refinement of P satisfying q{Q) < q{P) +j,Qis weak f{\P\) -regular and \Q\ < t^, where 
to = 1, ti = tj„ii?(/(ti_i)) for 1 < i < r and R(x) = 2'^^ as in the Frieze-Kannan weak regularity 
lemma. 

The proof of Lemma 12.61 is quite similar to the proof of Szemeredi's regularity lemma discussed in the 
previous subsection. One starts with the trivial partition Pq of V into one part. We then apply Lemma 



®They are also sharp, that is, there are graphs for which the minimum mimber of parts in any weak e-regular partition 
is 2"*' '\ This was proved in [25 (see also [5]). 

'^We will say more about this sort of iteration in Section 13.11 below. 

*More recently, Conlon and Fox [23] showed that it is also closely related to the regular approximation lemma. 
This lemma, which arose in the study of graph limits by Lovasz and Szegedy [TJ and also in work on the hypergraph 
generalization of the regularity lemma by Rodl and Schacht ^5], says that by adding and/or deleting a small number of 
edges in a graph G, we may find another graph G' which admits very fine regular partitions. We refer the reader to [21] 
and [87] for further details. 
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12.51 repeatedly to construct a sequence of partitions Pq,Pi, . . . so that Pj+i is weak /(|Pj|)-regular. If 
> q{Pi) + 7, then we continue with this process. Otherwise, q{Pi+i) < q{Pi) + 7, so we set 
Q = Pi+i and P = Pi and stop the process. This process must stop within r iterations as the third 
part of Proposition 12.11 shows that the mean-g density Hes in an interval of length r^. 

Rather than using the usual q{x) = x^, we will use the convex function q on [0, 1] defined by q(0) = 
and q{x) = xlogx for x S (0,1]. This entropy function is central to the proof since it captures the 
extra structural information coming from Lemma 12.81 below in a concise fashion. Note that the last 
part of Proposition 12.11 implies that dlogd < q{P) < for every partition P. 

The next lemma is a counting lemma that complements the Frieze-Kannan weak regularity lemma. 
As one might expect, this lemma gives a global count for the number of copies of H, whereas the 
counting lemma associated with the usual regularity lemma gives a means of counting copies of H 
between any v{H) parts of the partition which are pairwise regular. Its proof, which we omit, is by a 
simple telescoping sum argument. 

Lemma 2.7 fllS^ . Theorem 2.7 on page 1809) Let H he a graph on {l,...,/i} with m edges. Let 
G = {V, E) he a graph on n vertices and Q : V = ViU . . .UVt he a vertex partition which is weak 
e-regular. The numher of homomorphisms from H to G is within emn^ of 

h 

l<ii,...,ih<t {r,s)GE{H) a=l 

Let P and Q be vertex partitions of a graph G with Q a refinement of P. A pair (Vi, Vj) of parts of 
P is (a, c) -shattered by Q if at least a c-fraction of the pairs {u, v) £ Vi x Vj go between pairs of parts 
of Q with edge density between them less than a. 

One of the key components of the proof is the following lemma, which says that if P and Q are vertex 
partitions like those given by Lemma 12.61 then there are many pairs of vertex sets in P which are 
shattered by Q. 

Lemma 2.8 Let H be a graph on {1, . . . ,h} with m edges and let a > 0. Suppose G is a graph on 
n vertices for which there are less than 6n^ homomorphisms of H into G, where 6 = ja'^{2k)~^ . 
Suppose P and Q are equitahle vertex partitions of G with \P\ = k < n and Q is a refinement of P 
which is weak f{k)-regular, where f{k) = -^a^(2k)^^. For every h-tuple Vi,...,V/i of parts of P, 
there is an edge {i,j) of H for which the pair (Vi, Vj) is (a, 1^)- shattered hy Q. 



Proof: As |P| = < n, we have > ^ for each i. Let Qi denote the partition of Vi which consists 
of the parts of Q which are subsets of Vi. Consider an /i-tuple (vi, . . . ,Vh) G Vi x ■ ■ ■ x Vh picked 
uniformly at random. Also consider the event E that, for each edge {i,j) of H, the pair {vi,Vj) goes 
between parts of Qi and Qj with density at least a. If E occurs with probability at least 1/2, as Q 
is weak /(/c)-regular. Lemma 12.71 implies that the number of homomorphisms of H into G where the 
copy of vertex i is in for 1 < i < h is at least 

-a™ n l^^l ~ ^fi^)n^ > f -a™(2A:)-'* - mf{k)) = 6n^, 
^ i=i ^ 



10 



contradicting that there are less than 5n homomorphisms of H into G. So E occurs with probability 
less than 1/2. Hence, for at least 1/2 of the /i-tuples (fi, . . . ,Vh) € Vi x • • • x Vh, there is an edge 
of H such that the pair {vi,Vj) goes between parts of Qi and Qj with density less than a. This 
implies that for at least one edge of H, the pair {Vi,Vj) is (a, 2^)-shattered by Q. □ 

We will need the following lemma from [38] which tells us that if a pair of parts from P is shattered 
by Q then there is an increment in the mean-entropy density. Its proof is by a simple application of 
Jensen's inequality. 

Lemma 2.9 fl38^ . Lemma 7 on page 570) Let g : [0, 1] — t- M be the convex function given by q{0) = 
and q(x) = a; log x for x > 0. Let ei, . . . ,er and di, . . . ,dr be nonnegative real numbers with Yl\=i £j = 1 
and d = Yli=i ^i^i- Suppose /3 < 1 and L C [r] is such that di < f3d for i £ L and let s = X]ie/^«- 
Then 

r 

e,q{d,) > q{d) + (1 - /3 + g(/3))sd. 

j=l 

We are now ready to prove Theorem 11.41 in the following precise form. 

Theorem 2.1 Let H be a graph on {1, . . . ,h} with m edges. Let e > and be a tower of twos 
of height 8/i^ log e""*^. If G is a graph on n vertices in which at least en^ edges need to be removed to 
make it H-free, then G contains at least 5n^ copies of H. 

Proof: Suppose for contradiction that there is a graph G on n vertices in which at least en^ edges 
need to be removed from G to delete all copies of H, but G contains fewer than 5n^ copies of H. If 
n < 6^^^^, then the number of copies of H in G is less than 6n^ < 1, so G is H-free, contradicting that 
at least en^ edges need to be removed to make the graph if- free. Hence, n > 6~^^^. Note that the 
number of mappings from V{H) to V{G) which are not one-to-one is n'* — h\{^^ < h'^n^~^ < h'^d^^^n^. 
Let 5' = 2/i^J^/^, so the number of homomorphisms from H to G is at most 

The graph G contains at least en^ /m edge-disjoint copies of H. Let G' be the graph on the same 
vertex set which consists entirely of the at least er? jm edge-disjoint copies of H. Then d{G') > 
m ■ e/m = e and G' consists of '^^^ ^ n'^ edge-disjoint copies of H. We will show that there are at least 
homomorphisms from H to G' (and hence to G as well). For the rest of the argument, we will 
assume the underlying graph is G' . 

Let a = Apply Lemma [22] to G' with f{k) = -^a''^{2ky^ and 7 = ^|^. Note that r as in 
Lemma 12.61 is 

r = d{G')log{l/d{G'))/-f = 2/i^log(l/d(G')) < 2h^loge-\ 

Hence, we get a pair of equitable vertex partitions P and Q, with Q a refinement of P, q{Q) < q{P)-\-^, 
Q is weak /(|P|)-regular and \Q\ is at most a tower of twos of height 3r < 6/1^ loge~^. Let Vi, . . . , 
denote the parts of P and Qi denote the partition of Vi consisting of the parts of Q which are subsets 
ofVi. 

Suppose that {Va, Vb) is a pair of parts of P with edge density d = d{Va, V5) > e/m which is (a, 2^)- 
shattered by Q. Note that a < d/S. Arbitrarily order the pairs UixWi £ Qa^ Qb-, letting di = d{Ui, Wi) 
and ei = |y ^||^'| , so that the conditions of Lemma 12.91 with /3 = 1/8 are satisfied. Applying Lemma 
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12.91 we get, since q{l3) = — |log8 = — |, that 

q{Qa,Qb)-q{Va,Vi,) > {l-^ + qm^d{Va,Vb)\Vam\/n^ > ■^e{Va,Vk)/n\ 

Note that 

q{Q)-q{P)= iQiQa,Qb)-q{Va,Vb)), 

l<a,b<k 

which shows that q{Q) — q{P) is the sum of nonnegative summands. 

There are at most ^v?/2 edges of G' going between pairs of parts of P with density at most ^. Hence, 
at least 1/2 of the edge-disjoint copies of H making up G' have all its edges going between pairs of 
parts of P of density at least ^. By Lemma \T8l for each copy oi H, at least one of its edges goes 
between a pair of parts of P which is (a, 2^)-shattered by Q. Thus, 

where the sum is over all ordered pairs {Va,Vb) of parts of P which are (a, 2^)-shattered by Q and 
with d{Va,Vi,) > ^. This contradicts q{Q) < q{P) +7 and completes the proof. □ 



3 The induced removal lemma 

As in the last section, we will again discuss two different proofs of the induced removal lemma. 
Theorem 11.31 In Section 13. H we will discuss the proof of Alon, Fischer, Krivelevich and Szegedy 
[6], which uses their strong regularity lemma and gives a wowzer-type bound. In Section 13.21 we 
will examine the authors' recent proof [2jj of a tower-type bound. We will discuss Alon and Shapira's 
generalization of the induced removal lemma, which applies to infinite families of graphs, in Section [3^ 

3.1 The usual proof 

For an equitable partition P = {Vi\l < i < k} of V{G) and an equitable refinement Q = {Vij\l < i < 
k,l < j < i} of P, we say that Q is e-close to P if the following is satisfied. All 1 < i < i' < k but at 
most efc^ of them are such that, for all 1 < < I but at most of them, \diyi^ Fj')— < e 
holds. This notion roughly says that Q is an approximation of P . The strong regularity lemma of 
Alon, Fischer, Krivelevich and Szegedy [6j is now as follows. 

Lemma 3.1 (Strong regularity lemma) For every function / : N — t- (0, 1) there exists a number 
S = S{f) with the following property. For every graph G = {V,E), there is an equitable partition 
P of the vertex set V and an equitable refinement Q of P with \Q\ < S such that the partition P is 
f {1) -regular, the partition Q is f{\P\) -regular and Q is f{l)-close to P. 

That is, there is a regular partition P and a refinement Q such that Q is very regular and yet the 
densities between parts iVi^jiVi' of Q are usually close to the densities between the parts (Vi,Vi/) 
of P containing them. 

Let /(I) = e. Here, and throughout this section, we let q{x) = x^ be the square function as in the 
proof of Szemeredi's regularity lemma in the previous section. The condition that Q be e-close to P 
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is equivalent, up to a polynomial change in e, to q{Q) < q{P) + £• Indeed, if Q is e-close to P, then 
(l{Q) < (l{P) + 0{e), while if < q{P) + e, then Q is 0(e^/^)-close to P. A version of this statement 
is present in Lemma 3.7 of j6]. As it is sufficient and more convenient to work with mean-square 
density instead of e-closeness, we do so from now on. That is, we replace the third condition in the 
regularity lemma with the condition that q{Q) < q{P) + e. 

With this observation, the proof of the strong removal lemma becomes quite straightforward. Note 
that we may assume that / is a decreasing function by replacing it, if necessary, with the function given 
by /'(*) = iiiini<j<j /(j). We consider a series of partitions Pi,P2, . . . , where Pi is an /(l)-regular 
partition and Pj+i is an /(|Pj|) -regular refinement of the partition P^. Since Pj+i is a refinement 
of Pi we know that the mean-square density must have increased, that is, > q{Pi). If also 

(/(Pj+i) < g(Pj)+e then, since / is decreasing and Pj+i is /(|Pj|)-regular, we see that all three conditions 
of the theorem are satisfied with P = Pj and Q = Pj+i as the required partitions. Otherwise, we have 
q{Pi+i) > q{Pi) + e. However, since the mean-square density is bounded above by 1, this can happen 
at most times, concluding the proof. 

It is not hard to see why this proof results in wowzer-type bounds. At each step, we are applying 
the regularity lemma to find a partition Pj+i which is regular in the number of parts in the previous 
partition Pj. The bounds coming from the regularity lemma then imply that |Pj+i| = r(/(|Pi|)~*^'-^^). 
But this iterated tower-type bound is essentially how we define the wowzer function. 

That this is the correct behaviour for the bounds in the strong regularity lemma was proved indepen- 
dently by Conlon and Fox [24j and by Kalyanasundaram and Shapira |61j , though both proofs use 
slightly different ideas and result in slightly different bounds. With the function / : N — )• (0, 1) taken 
to be f{n) = e/n, the proof given in [24J shows that the number of parts in the smaller partition P 
may need to be as large as wowzer in a power of e~^, while that given in |61j proves that it must be 
at least wowzer in y^log e~^. 

The following easy corollary of the strong regularity lemma [6j is the key to proving the induced graph 
removal lemma. 

Lemma 3.2 For each < e < 1/3 and decreasing function / : N — >■ (0,1/3), there is 5' = 5'{e,f) 
such that every graph G = {V,E) with \V\ > 6'~^ has an equitable partition V = ViU . . .UVk and 
vertex subsets Wi C Vi such that \Wi\ > each pair {Wi,Wj) with l<i<j<kis f{k)-regular 

and all but at most ek"^ pairs I < i < j < k satisfy \d{Vi, Vj) — d{Wi, Wj)\ < e. 

In fact. Lemma 13.21 is a little bit stronger than the original version in [6] in that each set Wi is 
/(A;)-regular with itself The original version follows from the strong regularity lemma, applied with 

by taking the partition V = ViL) . . . UVk to be the partition P in the strong regularity lemma and 
the subset Wi to be a random part Vi^p C Vi of the refinement Q of P in the strong regularity lemma. 

Since f'{k) < ^(^^^) , it is straightforward to check that all pairs {Wi,Wj) are /(/c)-regular with 
probability greater than ^. Moreover, the expected number of pairs with 1 < i < j < k for which 

^This stronger version may be derived from an extra application of the regularity lemma within each of the pieces 
Wi, together with a suitable application of Ramsey's theorem. This is essentially the process carried out in [B], though 
they do not state their final result in the same form as Lemma [3.21 
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\d{Vi, Vj) - d{Wi, > e is at most 



4 V2y 4 V2y ~ 2 V2 

Here, the two | factors come from the definition of /(l)-closeness. The first factor comes from the 
fact that at most an |-fraction of the pairs (VijVj) do not have good approximations while the second 
factor comes from the fact that for ah other pairs there are at most an | fraction of pairs (Wi,Wj) 
which do not satisfy \d(yi,Vj) — d{Wi,Wj)\ < e. Therefore, by Markov's inequahty, the probabihty 
that the number of bad pairs is greater than 6(2) is less than ^. We therefore see that with positive 
probability there is a choice of Wi satisfying the required weaker version of Lemma 13.21 

If we assume the full strength of Lemma 13.21 as stated, that is, that each Wi is also /(A;)-regular 
with itself, it is easy to deduce the induced removal lemma. Let h = \V[H)\ and take f{k) = 
If there is a mapping (/> : V{H) — {1, . . . , A;} such that for all adjacent vertices w of the edge 
density between W^^^^ and W^^^-^ is at least e and for all distinct nonadjacent vertices f , w of the 
edge density between W^f^^-^ and is at most 1 — e, then the following standard counting lemma 

(see, e.g.. Lemma 3.2 in Alon, Fischer, Krivelevich and Szegedy ^ for a minor variant) shows that G 
contains at least 5n^ induced copies of H, where ^ = As with Lemma [231 the standard 

proof of this counting lemma uses a greedy embedding strategy. 



Lemma 3.3 If H is a graph with vertices 1, . . . ,/i and G is a graph with not necessarily disjoint 
vertex subsets Wi, . . . , Wh such that every pair (Wi, Wj) with l<i<j<his j -regular with 7 < 
\Wi\ > 7""'^ for 1 < i < h and, for I < i < j < k, d{Wi,Wj) > r] if {i,j) is an edge of H and 

d{yVi,Wj) < 1 — T] otherwise, then G contains at least (j) x ••• x \Wh\ induced copies of H 

with the copy of vertex i in Wi . 



Hence, we may assume that there is no such mapping (p. We then delete the edges between Vi and 
Vj if the edge density between Wi and Wj is less than e and add the edges between Vi and Vj if the 
density between Wi and Wj is more than 1 — e. The total number of edges added or removed is at 
most hev? and no induced copy of H remains. Replacing e by e/8 in the above argument gives the 
induced removal lemma. 



3.2 An improved bound 

The main goal of this section is to prove Theorem 11.51 which gives a bound on which is a tower 
in h of height polynomial in e~^. We in fact prove the key corollary of the strong regularity lemma. 
Lemma |3.2| with a tower- type bound. This is sufficient to prove the desired tower- type bound for the 
induced graph removal lemma. 

As in Section 12.21 the key idea will be to take a weak variant of Szemeredi's regularity lemma and 
iterate it. The particular variant we will use, due to Duke, Lefmann and Rodl [28], was originally used 
by them to derive a fast approximation algorithm for the number of copies of a fixed graph in a large 
graph. 

A k-cylinder (or cylinder for short) in a graph G is a product of k vertex subsets. Given a /c-partite 
graph G = {V, E) with /c-partition 1/ = Vi U . . . U V^, we will consider a partition fC of the cylinder 
Vi X • • • X Vfc into cyhnders = l^i x • • • x Wk, Wj C for i = 1, . . . , A: and we let Vi{K) = Wi. 
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We say that a cylinder is e-regular if all (2) pairs of subsets {Wi, Wj), I < i < j < k, are e-regular. 
The partition /C is e-regular if all but an e- fraction of the /c-tuples (vi, . . . , v^) G Vi x ■ ■ ■ x are in 
e-regular cylinders in the partition /C. 

The weak regularity lemma of Duke, Lefmann and Rodl [2H] is now as follows. Note that, like the 
Frieze-Kannan weak regularity lemma, it has only a single-exponential bound on the number of parts. 
We will sometimes refer to this lemma as the cylinder regularity lemma. 

Lemma 3.4 Let < e < 1/2 and /3 = ^(e) = e''''~\ Suppose G = {V,E) is a k-partite graph with 
k-partition V = ViU . . .UVk- Then there exists an e-regular partition K, 0/ Vi x • • • x into at most 
I3~^ parts such that, for each K £ IC and 1 < i < k, \Vi{K)\ > /3|T^|. 

We would now like to iterate this lemma to get a stronger version, the strong cylinder regularity 
lemma. Like Lemmas 12.61 and 13. H this will yield two closely related cylinder partitions P and Q with 
P regular and Q regular in a function of |P|. To state the lemma, we first strengthen the definition of 
regular cylinders so that pieces are also regular with themselves. 

A /c-cylinder Wi x ■ ■ ■ x Wk is strongly e-regular if all pairs (Wj, Wj) with 1 < i, j < k are e-regular. A 
partition /C of Vi x • • • x into cylinders is strongly e-regular if all but e| Vi| x • • • x | Va,.| of the A;-tuples 
(vi, . . . ,Vk) G Vi X ■ ■ ■ X Vk are contained in strongly e-regular cylinders K £ IC. 

We now state the strong cylinder regularity lemma. Here ti{x) is a variant of the tower function 
defined by tQ{x) = x and = 2***^^^. Also, given a cylinder partition /C, Q{IC) is the coarsest 

vertex partition such that every set Vi{K) with i G [k] and K £ IC is the union of parts of Q{IC). 

Lemma 3.5 For < e < 1/3, positive integer s, and decreasing function / : N — t- (0, e], there is S = 
S{e, s, f) such that the following holds. For every graph G, there is an integer s < k < S , an equitable 
partition P : V = Vi L) . . . L) Vk and a strongly f{k) -regular partition /C of the cylinder Vi x ■ ■ ■ x 
into cylinders satisfying that the partition Q = Q{IC) ofV has at most S parts and q{Q) < qiP) + e. 
Furthermore, there is an absolute constant c such that letting si = s and Sj+i = t4^{{si/ f{si)Y), we 
may take S = Si with £ = 2e~^ + 1. 

To prove this lemma, we need to find a way to guarantee that the parts of the cylinder partition 
are regular with themselves as required in the definition of strong cylinder regularity. For a graph 
G = {V,E), a vertex subset U C V is e-regular if the pair (U, U) is e-regular. The following lemma, 
which demonstrates that any graph contains a large vertex subset which is e-regular, is the first step. 

Lemma 3.6 For each < e < 1/2, let 5 = 5{e) = 2"*^ . Every graph G = {V,E) contains an 

e-regular vertex subset U with \U\ > S\V\. 

One way to prove this lemma is to first find a large collection C of disjoint subsets of equal order which 
are pairwise a-regular with a = (e/3)^. This can be done by an application of Szemeredi's regularity 
lemma and Turan's theorem, but then the bounds are quite weak. Instead, one can easily deduce this 
from Lemma 13.41 A further application of Ramsey's theorem allows one to get a subcollection C' of 
size s > 2a~^ such that the edge density between each pair of distinct subsets in C' lies in an interval 
of length at most a. The union of the sets in C' is then an e-regular subset of the desired order. 

It is crucial in this lemma that be of bounded tower height in e~^. While our bound gives a double 
exponential dependence, we suspect that the truth is more likely to be a single exponential. We leave 
this as an open problem. 
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Repeated applications of Lemma \3M allow us to pull out large, regular subsets until a small fraction of 
vertices remain. By distributing the remaining vertices amongst these subsets, we only slightly weaken 
their regularity, while giving a partition of any graph into large parts each of which is e-regular with 
itself. This will be sufficient for our purposes. 

Lemma 3.7 For each < e < 1/2, let 6 = 6{e) = 2~'^ . Every graph G = {V,E) has a vertex 

partition V = Vi^J . . .UVk such that for each i , 1 < i < k , \Vi\ > 5\V\ and Vi is an e-regular set. 

We are now ready to prove the strong cylinder regularity lemma. 

Proof of Lemma 13. 5t We may assume \V\ > S, as otherwise we can let P and Q be the trivial 
partitions into singletons, and it is easy to see the lemma holds. We will define a sequence of partitions 
Pi, P2, ... of equitable partitions, with Pj+i a refinement of Pj and q{Pj+i) > q{Pj) + e/2. Let Pi be 
an arbitrary equitable partition of V consisting of si = s parts. Suppose we have already found an 
equitable partition Pj : V = Vi U . . . U with k < Sj. 

Let f3{x,t) = x^'^^ ^ as in Lemma 13.41 and 6{x) = (^"Z"") Lemma 13.71 We apply Lemma 13.71 

to each part Vi of the partition Pj to get a partition of each part Vi = VnU . . .U Vifn of Pj into parts 
each of cardinality at least 6\Vi\, where 6 = 5(7) and 7 = f{k) ■ /3 with /3 = /3{f{k), k), such that each 
part Vih is 7-regular. Note that is at most triple-exponential in a polynomial in k/ f{k). For each 
/c-tuple I = (£1, . . . ,tk) G [^1] X • • • X [hk], by Lemma [3^ there is an /(A:)-regular partition K.^ of the 
cylinder Vu^ x • • • x Vki^ into at most j3~^ cylinders such that, for each K G /C^, |Vi4(i^)| > /3|Vi£j. 
The union of the fCi forms a partition /C of Vi x • • • x Vfc which is strongly /(/i;)-regular. 

Recall that Q = Q{JC) is the partition of V which is the common refinement of all parts Vi{K) with 
i G [k] and K ^ K,. The number of parts of K, is at most fi~^ and hence the number of parts of Q 
is at most k2^/^^ Thus, the number of parts of Q is at most quadruple-exponential in a polynomial 
in k/f{k). Let Pj+i be an equitable partition into 4e~^|(5| parts with g(Pj_|_i) > q{Q) — |, which exists 
by Lemma 12.31 Hence, there is an absolute constant c such that 

\Pj+i\<ui{k/fik)y)<s,+i. 

If q{Q) < q{Pj) + e, then we may take P = Pj and Q = Q{JC), and these partitions satisfy the desired 
properties. Otherwise, q{Pj+i) > q{Q) — f > Q{Pj) + f > and we continue the sequence of partitions. 
Since q{Pi) > and the mean-square density goes up by more than e/2 at each step and is always at 
most 1, this process must stop within 2/e steps, and we obtain the desired partitions. □ 

Let G = (y,E), P : V = Vi U . . . U be an equipartition and /C be a partition of the cylinder 
1^1 X • • • X Vfc into cyhnders. For K = Wi x ■ ■ ■ x Wk e JC, define the density d{K) = ^^]'^'.'.'.l\v^f ■ The 
cylinder K is e-close to P if \d{Wi,Wj) — d{yi,Vj)\ < e for all but at most ek^ pairs 1 < i ^ j < k. 
The cylinder partition /C is e-close to P if ^ d{K) < e, where the sum is over eill K G IC that are not 
e-close to P. As with the definition of closeness used in the strong regularity lemma, this definition is 
closely related to the condition that q{Q) < q{P) + e, where here Q = Q(/C). 
The connection we shall need to prove Lemma 13.21 is contained in the following statement. 

Lemma 3.8 Let G = (V,E) and P : V = Vi U . . . U Vk be an equipartition with k > 2e^^ and 
\V\ > 4A;e~^. Let JC be a partition of the cylinder Vi x • • • x into cylinders. If Q = Q{JC) satisfies 
q{Q) < q{P) + e, then K is (2e)i/^-dose to P. 
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Proof: It will be helpful to assume that all parts of the equipartition P have equal size - this affects 
the calculations only slightly. It will also be helpful to introduce a slight variant of the mean-square 
density as follows. Let q'{P) = Ei<j dHVi,Vj)Pij, where pij = \Vi\\Vj\/ ^^^j \Vi\\Vj\. Thus, q'{P) is 
the mean of the square densities between the pairs of distinct parts. It is easy to check that q'{P) is 
close to q{P). Indeed, we have q'{P) — q{P) = \ {q'{P) — q), where q = Yli=i is the average of 

the square densities inside the parts. Hence, \q'{P) — q{P)\ < p We similarly have \q'{Q) — qiQ)\ < p 

i<j K&K 
i<j K&fC 

J2 E {dmK),Vj{K))-d{V,,V,)fd{K), 

i<j KeK. 

where the last equality uses the identity diVi, Vj) = Y1,k<^k d{yi{K)^ Vj{K))d{K). This equality shows 
that q{lC) > q'{P) as it expresses their difference as a sum of nonnegative terms. Furthermore, it 
shows that if K. is not /3-close to P, then q{lC) > q{P) + (2)'^ • ^ • Z?^ • /3 > q{P) + P^. In particular, 
if g(/C) < q'{P) + 2e, then /C is (2e)^/4-close to P. So assume for contradiction that q{JC) > q'{P) +2e. 

A similar equality implies q'{Q) > q{IC). We therefore have 

qiQ)-q{P) = {qiQ)-q\Q)) + {q\Q)-q{JC)) + {q{IC)-q'iP)) + {q'iP)-qiP)) 

> "^ + 0+(g(/C)-,'(P))-i 

> e, 

contradicting the assumption of Lemma 13.81 and completing the proof. □ 
With this in hand, we can readily deduce a tower-type bound for Lemma |3.2[ 

Lemma 3.9 For each < e < 1/3 and decreasing function / : N — ?• (0,e], there is 5' = 6'{e,f) such 
that every graph G = {V,E) with \V\ > 6'~^ has an equitable partition V = ViU . . .UVk and vertex 
subsets Wi C Vi such that \Wi\ > S'\V\, each pair {Wi,Wj) with 1 < i < j < k is f{k)-regular and 
all but at most ek"^ pairs 1 < i < j < k satisfy \d{Vi,Vj) — d{Wi,Wj)\ < e. Furthermore, we may take 
6' = where S = S{y, s, f) is defined as in Lemma \3.5\ and s = 2e~^. 

Proof: Let a = y, s = 2e~^, and 6' = where S = S{a,s,f) is as in Lemma [331 We apply 
Lemma [331 with a in place of e. We get an equipartition P : V = Vi U . . . U Vk with s < k < S and a 
strongly /(A;)-regular partition /C of 14 x • • • x I4 into cylinders such that the refinement Q = Q{fC) 
of P has at most S = S{a,s,f) parts and satisfies q{Q) < q{P) + a. Since \V\ > 6'^^ = 85^, and 
P is an equipartition into k < S parts, the cardinality of each part Vi G P satisfies \Vi\ > -^jr. By 
Lemma 13.81 (2q)^/^ = e, the cylinder partition /C is e-close to P. Hence, at most an e-fraction of 
the A:-tuples {vi, . . . , v^) G ^1 x • • • x belong to parts K = Wi x • • • x Wk of K, that are not e-close 
to P. Since Q(/C) has at most S parts, the fraction of fc-tuples (fi, . . . , Vk) G Fl x • • • x that belong 




We have the following equalities 
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to parts K = Wi x • • • x of K, with \Wi\ < jg\Vi\ for at least one i E [k] is at most jg ■ S = j. 
Therefore, at least a fraction I — f{k) — e — j > of the A;-tuples [vi, . . . ,Vk) ^ Vi x ■ ■ ■ x Vk belong to 
parts K = Wi X ■ ■ ■ X Wk of /C satisfying K is strongly /(/c)-regular, \Wi\ > jg\Vi\ > S'\V\ for i G [k] 
and K is e-close to P. Since a positive fraction of the A;-tuples belong to such K, there is at least one 
such K. This K has the desired properties. Indeed, the number of pairs 1 < i ^ j < k for which 
\d{Wi,Wj) — d{Vi,Vj)\ > e is at most e/c^ and hence the number of pairs 1 < i < j < k ior which 
\d{Wi, Wj) - d{Vi, Vj)\ > e is at most ek"^ /2 + k< ek"^. This completes the proof. □ 

By using the induced counting lemma. Lemma 13.31 we may now conclude the proof as in Section 13.11 
to obtain the following quantitative version of Theorem 11.31 

Theorem 3.1 There exists a constant c such that, for any graph H on h vertices and < e < 1/2, 
if 5^^ = tj{h), where j = ce~^, then any graph G on n vertices with at most dn^ induced copies of H 
may be made induced H-free by adding and/or deleting at most en^ edges. 



3.3 Infinite removal lemma 

In order to characterize the natural graph properties which are testable, the induced removal lemma 
was extended by Alon and Shapira [12] to the following infinite version. For a family % of graphs, a 
graph G is induced Ti-iree if G does not contain any graph H vaU.. 



Theorem 3.2 For every (possibly infinite) family of graphs % and e > 0, there are uq, Hq, and 6 such 
that the following holds. If a graph G = {V, E) on n > hq vertices has at most 6n^ induced copies of 
each graph H €z H on h < Hq vertices, then G can be made induced T-L-free by adding and/ or deleting 
at most en^ edges. 



Proof: The proof is a natural extension of the proof of the induced removal lemma and similarly 
uses the key corollary. Lemma 13.21 of the strong regularity lemma. The main new idea is to pick an 
appropriate function / to apply Lemma 13.21 The choice of the function / will depend heavily on the 
family %. 

For a graph H and an edge-coloring c of the edges of the complete graph with loops R on [k] with 
colors white, black and grey, we write H — )-c R if there is a mapping <j) : V{H) — )• [k] such that for each 
edge (u, v) of H we have that c{4>{u), 4>{v)) is black or grey and for each pair (u, v) of distinct vertices 
of H which do not form an edge we have that c{(p{u) , (p{v)) is white or grey. We write H -f^^ R if 
K — s-c R does not hold. 

Let P : y = 14 U . . . U Vfc be a vertex partition of G. A key observation is that if we round G by the 
partition P and the coloring c to obtain a graph G' on the same vertex set as G by adding edges to 
make {Vi, Vj) complete if (i, j) is black, deleting edges to make (Vj, Vj) empty if {i,j) is white and we 
have that H -/^c R, then G' does not contain H as an induced subgraph. 

For any (possibly infinite) family of graphs Ti and any integer r, let Tir be the following set of colored 
complete graphs with loops: a colored complete graph with loops R belongs to Tir if and only if it has 
at most r vertices and there is at least one H £ T-L such that H — )-c R. For any family Ti of graphs 
and integer r for which Tir 7^ 0, let 

^-ij(r) = max min 
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If Tir = 0, define ^y^^r) = 1. Note that ^^^{r) is a monotonically increasing function of r. Let 



fir) 



Note that the function / only depends on e and H. 

Let 6' = 5'{e, f) be as in Lemma [3^ which only depends on e and T-L. Also let = 2(5'~^, /iq = ^'-^(A^o), 
no = 1/ {5'f{ko)) and 5 = ^(e/4)'*0(5''^o. We have that ko, ho, uq and 5 > only depend on e and ^. 
By assumption, G has n > uq vertices. 

We apply Lemma [32] to G. We get an equitable vertex partition P : V = ViU . . .UVk of G and subsets 
Wi C Vi with \Wi\ > S'\V\ such that, for 1 < i < j < fc, the pair is /(/c)-regular and all but 

at most pairs l<i<j<k satisfy |d(Vi, V,) - d(Wi, Wj)\ < e. As < |Wi| < \Vi\ < 2n/k, we 
have k < 26''^ < ko. 

Consider the coloring c of the complete graph with loops R on [k] where a pair of vertices is black 
if d(Wi, Wj) > 1 — e, white if d{Wi, Wj) < e and grey if e < d{Wi, Wj) < 1 — e. Suppose, for the sake 
of contradiction, that there is a graph H with H — t-c R. From the definition of ^, there is a graph H 
on h < ^■}{{k) vertices with H —t-c R- As k < ko, the number of vertices of H satisfies h < ho- As 
each pair {Wi,Wj) is /(/c)-regular and |Wj| > 5'\V\ > f{k)~^, applying the induced counting lemma. 
Lemma [33] with 7 = f{k), we get at least 



induced copies of H in G, contradicting the supposition of the theorem. Thus, there is no graph H 
with H —T-c R. 

We round the graph G by the partition P and the coloring c as described earlier in the proof to obtain 
a graph G' . By the key observation, for each graph H with H -f^^ R, the graph G' does not contain 
H as an induced subgraph. Hence, G' is induced ^-free. 

Moreover, not many edges were changed from G to obtain G' . Indeed, as there are at most ek'^ pairs 
1 < ^ < j < which satisfy \d{Vi,Vj) — d{Wi,Wj)\ > e, the number of edge modifications made 
between such pairs is at most ek'^ ■ {2n/k)'^ = Aen? . Between the other pairs we have made at most 
SeQ) ^ e'^'^ edge modifications. In total, at most Sen-^ edge modifications were made to obtain G' 
from G. Replacing e by e/5 in the above argument completes the proof. □ 

4 Arithmetic removal 

The notion of arithmetic removal was introduced by Green |57] . By establishing an appropriate variant 
of the regularity lemma in the context of abelian groups, he proved the following result. 

Theorem 4.1 For any natural number k >3 and any e > 0, there exists 6 > such that if G is an 
abelian group of order n and Ai, . . . , are subsets of G such that there are at most 5n^~^ solutions to 
the equation oi + 02 + • • • + afc = with ai G Ai for all i then it is possible to remove at most en elements 
from each set Ai to form sets A'- so that there are no solutions to the equation a'^ + 02 + • • • + a'^ = 
with a'j G A'- for all i. 
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It is an exercise to show that Green's result imphes Roth's theorem. While Green's proof of this result 
relied on Fourier analytic techniques, an alternative proof was found by Krai', Serra, and Vena j7U| . 
who showed that the following more general result follows from an elegant reduction to the removal 
lemma in directed graphs. 

Theorem 4.2 For any natural number k > 3 and any e > 0, there exists 5 > such that if G is a 
group of order n, g G G and Ai, . . . , Aj. are subsets of G such that there are at most 6n^~^ solutions 
to the equation 0102 ■ ■ ■ = g with a, G Ai for all i then it is possible to remove at most en elements 
from each set Ai to form sets A[ so that there are no solutions to the equation a'j^a'g • • • a'j^ = g with 
a[ G A'^ for all i. 

This is stronger than Theorem 14. II in two ways. Firstly, it applies to all groups and not just to abelian 
groups. Secondly, it applies to non-homogeneous equations, that is, 0102 • • • = 5 for a general 5, 
whereas Green only treats the homogeneous case where g = 1. To give some idea of their proof, we 
will need the following definition. 

A directed graph is a graph where each edge has been given a direction. Formally, the edge set may be 
thought of as a collection of ordered pairs. We will always assume that the directed graph has no loops 
and does not contain parallel directed edges, though we do allow anti-parallel edges, that is, both the 
edge uv and the edge vu. The following analogue of the graph removal lemma for directed graphs was 
proved by Alon and Shapira [9j as part of their study of property testing in directed graphs. 

Theorem 4.3 For any directed graph H and any e > 0, there exists (5 > such that any directed 
graph on n vertices which contains at most Sn"^^^ copies of H may be made H-free by removing at 
most en^ edges. 

We will show how to prove Theorem 14.21 with g = 1 using Theorem 14.31 Suppose that G is a group of 
order n and Ai, . . . , A^ are subsets of G such that there are at most (5n'^~^ solutions to the equation 
aia2 ■ ■ ■ CLk = 1 with aj G Ai for all i. Consider the auxiliary directed graph F whose vertex set is 
G X {1, 2, ... , k}. We place an edge from (x, i) to (y, i + 1), where addition is taken modulo k, if there 
exists ai £ Ai such that xoi = y. It is easy to see that any directed cycle in F corresponds to a solution 
of the equation aia2 - ■ ■ a^ = 1. Moreover, every such solution will result in n different directed cycles 
in F, namely, those with vertices {x, 1), {xai, 2), {xaia2, 3), . . . , {xai ■ ■ ■ afc_i, k). 

Since G has at most Sn^~^ solutions to aia2 • • • Ofc = 1, this implies that there are at most 6n^ directed 
cycles in F. By Theorem l4.31 for an appropriately chosen 6, we may therefore remove at most edges 
to make it free of directed cycles of length k. In ^dj, we now remove the element Oj if at least ^ edges of 
the form (x, i){xai,i + l) have been removed. Note that this results in us removing at most en elements 
from each Ai. Suppose now that the remaining sets A'^ are such that there is a solution 0^02 . . . a'^ = 1 
with a'i G A[ for all i. Then, as above, there are at least n cycles [x, 1), (xa'^, 2), . . . , {xa'i ■ ■ • a'j^_^, k) 
corresponding to this solution. Since we must have removed one edge from each of these cycles, we 
must have removed at least ^ edges of the form {y,i){ya'i,i + 1) for some i. But this implies that 
a'- ^ A'^, yielding the required contradiction. 

It was observed by Fox [38] that in Theorem 14.31 mav. like the graph removal lemma, be taken to 
be at most a tower of twos of height logarithmic in e~^. This may in turn be used to give a similar 
bound for 6^^ in Theorem 14.21 

In [To], Krai', Serra and Vena also showed how to prove a removal lemma for systems of equations 
which are graph representable, in the sense that they can be put in a natural correspondence with a 
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directed graph. An example of such a system is 

XlX2X^^X^^ = 1 
XlX2X'^^ = 1. 

This idea of associating a system of hnear equations with a directed graph representation was extended 
to hypergraphs independently by Krai', Serra and Vena [71] and by Shapira [11)^111)1] in order to prove 
the following theorem (some partial results had been obtained earlier by Krai', Serra and Vena [69j . 
Szegedy [l08] and Candela [20]). 

Theorem 4.4 For any natural numbers k and i and any e > 0, there exists 5 > such that if F is 
the field of size n, M is an £ x k matrix with coefficients in F, b € F^ and Ai,. . . ,Ak are subsets 
of F such that there are at most 5n^~^ solutions a = (oi, . . . ,ak) of the system Ma = b then it is 
possible to remove at most en elements from each set Ai to form sets A'- so that there are no solutions 
a' = {a'l, . . . , a^) to the equation Ma' = b with a'- £ A'- for all i. 

An easy application of this result shows that a removal lemma for systems of linear equations holds in 
the set [n], confirming a conjecture of Green [57]. We remark that this result easily implies Szemeredi's 
theorem. Both proofs use a colored variant of the hypergraph removal lemma due to Austin and Tao 
|13j . though the representations which they use to transfer the problem to hypergraphs are different. 

It would be interesting to know whether an analogous statement holds for all groups. A partial 
extension of these results to abelian groups is proved in [72] (see also [108j ) but already in this case 
there are technical difficulties which do not arise for finite fields. 

5 Sparse removal 

Given graphs T and H, let N}{{T) be the number of copies of if in F. A possible generalization of the 
graph removal lemma, which corresponds to the case T = Kn, could state that if G is a subgraph of T 
with Nh{G) < 6Nh{T) then G may be made H-free by deleting at most ee{T) edges. Unfortunately, 
this is too much to hope in general. However, if the graph T is sufficiently well-behaved, such an 
extension does hold. We will discuss two such results here. 

5.1 Removal in random graphs 

The binomial random graph Gn,p is formed by taking n vertices and considering each pair of vertices 
in turn, choosing each connecting edge to be in the graph independently with probability p. These 
graphs were introduced by Erdos and Renyi [33\ [M] in the late fiftieq^ and their study has grown 
enormously since then (see, for example, the monographs |17[ I60j ). 

Usually, one is interested in finding a threshold function p* := p*[n) where the probability that the 
random graph Gn,p has a particular property V changes from o(l) to 1 — o(l) as we pass from random 
graphs chosen with probability p p* to those chosen with probability p ^ p* . For example, the 
threshold for the random graph to be connected is at p*{n) = 

'^^The notion was also introduced independently by several other authors at about the same time but, quoting BoUobas 
[17] . "Erdos and Renyi introduced the methods which underlie the probabilistic treatment of random graphs. The other 
authors were all concerned with enumeration problems and their techniques were essentially deterministic." 
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One theme that has received a lot of attention in recent years is the question of determining thresholds 
for the appearance of certain combinatorial properties. One well-studied example is the Ramsey 
property. Given a graph H and a natural number r > 2, we say that a graph G is {H,r)-Ramsey if 
in any r-coloring of the edges of G there is guaranteed to be a monochromatic copy of H. Ramsey's 
theorem [82j is itself the statement that Kn is (H, r)-Ramsey for n sufficiently large. The following 
celebrated result of Rodl and Rucihski [831 Ell from 1995 (see also [60j, Chapter 8) determines the 
threshold for the appearance of the Ramsey property in random graphs. 



Theorem 5.1 For any graph H that is not a forest consisting of stars and paths of length 3 and every 
positive integer r >2, there exist constants c, C > such that 



lim F(Gnp is {H,r)- Ramsey) 

where 



'O, if p < cn-^/"^^'^^\ 
1, ifp>Cn-^l'^^^^\ 



m2(H) = max 14^-^ ■ H' Q H and v(H') > 3 1 . 

[v{H')-2 J 

The threshold occurs at the largest value of p* such that there is some subgraph H' of H for which the 
number of copies of H' is approximately the same as the number of edges. For p significantly smaller 
than p* , the number of copies of H' will also be significantly smaller than the number of edges. This 
property allows us (by a rather long and difficult argument [83]) to show that the edges of the graph 
may be colored in such a way as to avoid any monochromatic copies of H' . For p significantly larger 
than p* , every edge of the random graph is contained in many copies of every subgraph of H. The 
intuition, which takes substantial effort to make rigorous ^84]; is that these overlaps are enough to 
force the graph to be Ramsey. 

Many related questions were studied in the late nineties. In particular, people were interested in 
determining the threshold for the following Turan property. Given a graph H and a real number 
e > 0, we say that a graph G is (H, e)- Turan if every subgraph of G with at least 

edges contains a copy of H. The classical Erdos-Stone-Simonovits theorem [351 EH I115j states that 
the graph Kn is {H, e)-Turan for n sufficiently large. Resolving a conjecture of Haxell, Kohayakawa, 
Luczak and Rodl [581 [65], Conlon and Gowers [25] and, independently, Schacht [98] proved the following 
theorem. It is worth noting that the result of Conlon and Gowers applies in the strictly balanced case, 
that is, when m2{H') < m2{H) for all H' C H, while Schacht's result applies to all graphs. However, 
the class of strictly balanced graphs includes most of the graphs one would naturally consider, such 
as cliques or cycles. 



Theorem 5.2 For any graph and any e > 0, there exist positive constants c and G such that 

lim ¥[Gnp is {H, e)- Turan) 



0, if p < cn-^l'^^^^\ 

1, if p > Gn~^/"'^^^\ 



'Note that if H = K2, we take m2{H) = i. 
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The results of [25] and |98) (see also [33j) allow one to prove thresholds for the appearance of many 
different combinatorial properties. For example, the results extend without difficulty to prove ana- 
logues of Theorems 15. II and 15.21 for hypergraphs. The results also apply to give thresholds in different 
contexts - one example is an extension of Szemeredi's theorem to random subsets of the integers. 

Perhaps surprisingly, the methods used in [25] and [98] are very different and have different strengths 
and weaknesses. We have already mentioned that Schacht's results applied to all graphs while the 
results of Conlon and Gowers only applied to strictly balanced graphs. On the other hand, the results 
of [25] also allowed one to transfer structural statements to the sparse setting, including the stability 
version of the Erdos-Stone-Simonovits theorem [103j and the graph removal lemma. More recently, 
Samotij [SI] modified Schacht's method to extend this sparse stability theorem to all graphs. The 
result is the following theorem. 

Theorem 5.3 For any graph H and any e > 0, there exist positive constants 5 and C such that if 
p > Cn^^^™"^^^^ then the following holds a.a.s. in Gn,p- Every H-free subgraph of Gn,p with at least 

~ x{H)-i ~ ^(2) ^^9^^ made {x{H) — l)-partite by deleting at most epn^ edges. 

Recently, a third method was developed by Balogh, Morris and Samotij [T5] and, simultaneously and 
independently, by Saxton and Thomason ^97j for proving sparse random analogues of combinatorial 
theorems. One of the results of their research is a proof of the KLR conjecture of Kohayakawa, 
Luczak and Rodl [BS]- This is a technical statement which allows one to prove an embedding lemma 
complementing the sparse regularity lemma of Kohayakawa [63] and Rodl. A variant of this conjecture 
has also been proved by Conlon, Gowers, Samotij and Schacht [26] using the methods of |25l|98]. One 
of the applications of this latter result is the following sparse random analogue of the graph removal 
lemma (this was already proved for triangles in [65] and for strictly balanced graphs in |25|). 

Theorem 5.4 For any graph H and any e > 0, there exist positive constants 5 and C such that if 
p > Cn~^/'^^^^^ then the following holds a.a.s. in Gn,p. Every subgraph of Gn,p which contains at 
most Sp^^^^n"^^^ copies of H may be made H-free by removing at most epn^ edges. 

Note that for any e there exists a positive constant c such that if p < cn"^/™^^^), the removal lemma 
is trivial. This is because, for c sufficiently small, the number of copies of the densest subgraph H' of 
H will a.a.s. be smaller than epn^. Theorem 15.41 shows that it also holds for p > Cn~^^^^^^\ This 
leaves a small intermediate range of p where it might also be expected that a sparse removal lemma 
a.a.s. holds. That this is so was conjectured by Luczak [76] . 

For balanced graphs H, we may close the gap by letting 5 be sufficiently small depending on G, e and H. 
Indeed, as p < Cn~^^'^^^^\ the number of copies of H is a.a.s. on the order oi p^^^^ n""^^^ < C^^^^pn^ . 
Therefore, taking 6 < eG~^^^\ we see that the number of copies of H is a.a.s. less than epn'^. Deleting 
one edge from each copy of H in the graph then makes it H-free. 

A sparse random analogue of the hypergraph removal lemma was proved in [25] when T-L = Kf^_l_^. This 
result also extends to cover all strictly balanced hypergraphs0 It would be interesting to extend this 
result to all hypergraphs. 

It is worth noting that the sparse random version of the triangle removal lemma does not imply a 
sparse random version of Roth's theorem. This is because the reduction which allows us to pass from 

^^We note that for fc-uniform hypergraphs the relevant function is mk(H) = max | ^(-H'j-fc | ' where the maximum is 
taken over all subgraphs T-L' of H. with at least fc + 1 vertices. 
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a subset of the integers with no arithmetic progressions of length 3 to a graph containing few triangles 
gives us a graph with dependencies between its edges. This issue does not occur with pseudorandom 
graphs, which we discuss in the next section. 

5.2 Removal in pseudorandom graphs 

Though there have long been explicit examples of graphs which behave like the random graph Gn,pi 
the first systematic study of what it means for a given graph to be like a random graph was initiated 
by Thomason [llS j 1114] . Following himl^ we say that a graph on vertex set V is {p^ f3) -jumbled if, for 
all vertex subsets X,Y C.V, 

\e{X,Y) -p\X\\Y\\ < f3y^\X\\Y\. 

The random graph Gn,p is, with high probability, (p, /3) -jumbled with /3 = 0{^/pn). This is also 
optimal in that a graph on n vertices with p < 1/2 cannot be (p, /3)-jumbled with /3 = o{y/pn). The 
Paley graph is an example of an explicit graph which is optimally jumbled. This graph has vertex set 
Zp, where p = l(mod 4) is prime, and edge set given by connecting x and y if their difference is a 
quadratic residue. It is (p, /3)-jumbled with p = \ and (5 = 0{y/n). Many more examples are given in 
the excellent survey [73] . 

A fundamental result of Chung, Graham and Wilson |22| states that for graphs of density p, where p 
is a fixed positive constant, the property of being (p, o(re))-jumbled is equivalent to a number of other 
properties that one would typically expect in a random graph. For example, if the number of cycles 
of length 4 is as one would expect in a binomial random graph then, surprisingly, this is enough to 
imply that the edges are very well-spread. 

For sparser graphs, the equivalences are less clear cut, but the notion of jumbledness defined above is 
a natural property to study. Given a graph property V that one would expect of a random graph, one 
can ask for the range of p and (3 for which a (p, /3)-jumbled graph satisfies V. 

To give an example, it is known that there is a constant c such that if /3 < cp^n then any (p, /3)-jumbled 
graph contains a triangle. It is also known that this is sharp, since an example of Alon [2J gives a 
triangle-free graph with p = which is optimally jumbled, so that /3 = 0{^/pn) = 0{p^n). 

As in the previous section, one can ask for conditions on p and (3 which guarantee that a (p, /3)-jumbled 
graph satisfies certain combinatorial properties. For the property of being (i^s, e)-Turan, this question 
was addressed by Sudakov, Szabo and Vu |107j (see also [21]), who showed that it was enough that 
/3 < cp^n for an appropriate c. This is clearly sharp, since for larger values of /3 we cannot even 
guarantee that the graph contains a triangle. More generally, they proved the following theoremQ 

Theorem 5.5 For any natural number t > 3 and any e > 0, there exists c > such that if /3 < cp^^^n 
then any (p, 13) -jumbled graph is {Kt,€)-Turdn. 

Except in the case of triangles, there are no known constructions which demonstrate that this theorem 
is tight. However, it is conjectured [107] that the bound on /3 in Theorem 15.51 is the correct condition 
for finding copies of Kt in a {p, /3)-jumbled graph. This would in turn imply that Theorem 15.51 is tight. 

^^Strictly speaking, Thomason considered a slightly different notion, namely, that |e(X)— p('^') | < I3\X\ for all X (1 V, 
but the two are closely related. 

^^Their results were only stated for the special class of (p, /3)-jumbled graphs known as (n, d, A)-graphs. These are 
graphs on n vertices which are d-regular and such that all eigenvalues of the adjacency matrix, save the largest, have 
absolute value at most A. The expander mixing lemma implies that these graphs are {p, /3)-jumbled with p ~ ^ and 
/3 — X. However, it is not hard to verify that their method applies in the more general case. 
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For the triangle removal lemma, the following pseudorandom analogue was recently proved by Ko- 
hayakawa, Rodl, Schacht and Skokan |68| . 

Theorem 5.6 For any e > 0, there exist positive constants 6 and c such that if (3 < cp^n then any 
{p, (3) -jumbled graph G onn vertices has the following property. Any subgraph of G containing at most 
5p^n^ triangles may be made triangle-free by removing at most epn^ edges. 

The condition on j3 in this theorem is stronger than that employed for triangles in Theorem I5.5i As 
a result, Alon's construction does not apply and it is an open problem to determine whether the 
condition /? < cp^n is optimal or if it can be improved to /3 < cp^n. Kohayakawa, Rodl, Schacht and 
Skokan conjecture the latter, though we feel that the former is a genuine possibility. 

In a recent paper, Conlon, Fox and Zhao [27] found a way to prove a counting lemma for embedding 
any fixed small graph into a regular subgraph of a sufficiently pseudorandom host graph. Like the KLR 
conjecture for random graphs, this serves to complement the sparse regularity lemma of Kohayakawa 
|63j and Rodl in the pseudorandom context. As corollaries, they extended Theorems 15.51 and 15.61 to 
all graphs and proved sparse pseudorandom extensions of several other theorems, including Ramsey's 
theorem and the Erdos-Simonovits stability theorem. 

To state these theorems, we define the degeneracy d{H) of a graph H to be the smallest nonnegative 
integer d for which there exists an ordering of the vertices of H such that each vertex has at most d 
neighbors which appear earlier in the ordering. Equivalently, it may be defined as d{H) = max{5(i/') : 
H' C where 5{H) is the minimum degree of 

The pseudorandom analogue of the graph removal lemma proved in [27] is now as follows!^ 

Theorem 5.7 For any graph H and any e > 0, there exist positive constants 5 and c such that if 
13 < cp<^^+in then any {p, /3)-jumbled graph G on n vertices has the following property. Any subgraph 
of G containing at most Sp^^^^n^^^^ copies of H may be made H-free by removing at most epn^ edges. 

It is not hard to show, by using the random graph, that there are {p, /3)-jumbled graphs with P = 
0(^p(^(^)+'^y^n) which contain no copies of H. We therefore see that the exponent of p is sharp up to a 
multiplicative constant. However, in many cases, we expect it to be sharp up to an additive constant. 

For certain classes of graph. Theorem 15.71 can be improved. For example, if we know that the degen- 
eracy of the graph is the same as the maximum degree, such as what happens for the complete graph 
Kt, it is sufficient that (3 < cp'^^^^'^^n. In particular, for K^., we reprove Theorem 15.61 For cycles, the 
improvement is even more pronounced, since /3 < cp^^n, where ta = 3, t4 = 2, = 1 + if ^ > 5 is 
odd and = 1 + if £ > 6 is even, is sufficient for removing the cycle Ci. 

By following the proof of Krai', Serra and Vena [TOj, these bounds on the cycle removal lemma in 
pseudorandom graph£3 allow us to prove an analogue of Theorem 14.21 for pseudorandom subsets of 
any group G. The Cayley graph G{S) of a subset 5 of a group G has vertex set G and (x, y) is an edge 
of G if x~^y G S. We say that a subset S of a group G is (p, /3) -jumbled if the Cayley graph G{S) is 

'^^In [27], a slightly diflterent parameter, the 2-degeneracy d2{H), is used. Though there are many cases in which this 
parameter is more appropriate, the degeneracy will be sufficient for the purposes of our discussion here. 

^®For other properties, such as that of being (H, r)-Ramsey or that of being {H, e)-Turan, an exactly analogous theorem 
holds with the same condition /? < cp'^^^^^'^n. Any of the improvements subsequently discussed for specific graphs H 
also apply for these properties. 

^^Rather, a colored or directed version of this theorem. 
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(p, /3)-jumbled. When G is abelian, if |X]xeS'^(^)| — 1^ nontrivial characters x'- G ^ C, then S 

is (|^, /3)-jumbled (see [681 Lemma 16]). 

Theorem 5.8 For any natural number k > 3 and any e > 0, there exist positive constants 5 and c 
such that the following holds. Suppose Bi, . . . , are subsets of a group G of order n such that each 
Bi is {p, f3)-jumhled with (3 < cp^''n. If subsets Ai C Bi for i = 1, . . . , k are such that there are at most 
S\Bi\ ■ ■ ■ \Bk\/n solutions to the equation xiX2 ■ ■ ■ x^ = 1 with Xi £ Ai for all i, then it is possible to 
remove at most e\Bi\ elements from each set Ai so as to obtain sets A'- for which there are no solutions 
to X1X2 • • • Xfc = 1 with Xi £ A'- for all i. 

This result easily implies a Roth-type theorem in quite sparse pseudorandom subsets of a group. We say 
that a subset B of a group G is (e, k)-Roth if, for all integers ai, . . . , which satisfy ai + - • ■+ak = and 
gcd(aj, |G|) = 1 for 1 < i < A;, every subset A <^ B which has no nontrivial solution to x"^X2^ • • • x^'' = 1 
has < e\B\. 

Corollary 5.1 For any natural number k > 3 and any e > 0, there exists c > such that the following 
holds. If G is a group of order n and B is a {p, f3)- jumbled subset of G with f3 < cp^^n, then B is 
{e,k)-Roth. 

Note that Roth's theorem on 3-term arithmetic progressions in dense sets of integers follows from the 
special case of this result with B = G = Z„, k = 3 and oi = 02 = 1, 03 = —2. The rather weak 
pseudorandomness condition in Corollary 15.11 shows that even quite sparse pseudorandom subsets of 
a group have the Roth property. 



6 Further topics 

6.1 The Erdos-Rothschild problem 

A problem of Erdos and Rothschild ^30j asks one to estimate the maximum number h{n, c) such that 
every n-vertex graph with at least cn^ edges, each of which is contained in at least one triangle, must 
contain an edge that is in at least h{n, c) edges. Here, and throughout this subsection, we assume 
c > is a fixed absolute constant. The fact that h{n,c) tends to infinity already follows from the 
triangle removal lemma 1^ 

To see this, suppose that G is an n-vertex graph with cn^ edges such that every edge is in at least one 
and at most h := h{n,c) triangles. The total number of triangles in G is at most /icn^/3. Therefore, 
if h does not tend to infinity, the triangle removal lemma tells us that there is a collection E of o(n^) 
edges such that every triangle contains at least one of them. Since each edge in G is in at least one 
triangle, we know that there are at least triangles. It follows that some edge in E is contained 

in at least a;(l) edges. 

Using Fox's bound [38] for the triangle removal lemma, this implies that h{n,c) > e"^°^ where 
log* n is the iterated logarithm. This is defined by log* x = if x < 1 and log* x = log* (log x) -|- 1 
otherwise. This improves on the bound h{n, c) > (log* n)" which follows from Ruzsa and Szemeredi's 
original proof of the triangle removal lemma. 

^^Even the statement that h{n, c) > 1 is aheady enough to imply Roth's theorem. 
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On the other hand, Alon and Trotter (see [3T]) showed that for any positive c < j there is c' > such 
that h{n,c) < d^/n. The condition c < | is easily seen to be best possible since any n-vertex graph 
with more than edges contains an edge in at least n/6 triangles [291 [62]. Erdos conjectured that 
perhaps this behaviour is correct. That is, that for any positive c < \ there exists e > such that 
/i(n, c) > rf for all sufficiently large n. This was recently disproved by Fox and Loh [39] as follows. 

Theorem 6.1 For n sufficiently large, there is an n-vertex graph with ^(1 — 6"^'°^"-'^'''') edges such 
that every edge is in a triangle and no edge is in more than nWloglogn triangles. 

To give some idea of the construction, consider a tripartite graph between sets A, B and C, each of 
which is a copy of a lattice cube with appropriate sidelength r and dimension d. We join points in 
A and B if their distance is close to the expected distance between random points in A and B. By 
concentration, this implies that the density of edges between A and B is close to 1. We join points in 
C to points in A or B if their distance is close to half the expected distance. It is not hard to see that 
every edge between A and B is then contained in few triangles. At the same time, every edge will be 
in at least one triangle, as can be seen by considering the midpoint of any two connected points a and 

2 

h. This yields a construction with roughly ^ edges but the result of Fox and Loh may be obtained 
by shrinking the vertex set C (or blowing up A and B) in an appropriate fashion. 

6.2 Induced matchings 

Call a graph G = iV^E) an {r,t)-Ruzsa-Szemeredi graph ((r, t)-RS graph for short) if its edge set can 
be partitioned into t induced matchings in G, each of size r. The total number of edges of such a 
graph is rt. The most interesting problem concerns the existence of such graphs when r and t are both 
relatively large as a function of the number of vertices. The construction of Ruzsa and Szemeredi [93] 
using Behrend's construction demonstrates that such a graph on n vertices exists with r = e~'^^^°^"'n 
and t = n/3. The Ruzsa-Szemeredi result on the (6, 3)-problem is equivalent to showing that no 
(r, f)-RS graph on n vertices exists with r and t linear in n. 

For r linear in the number n of vertices, it is still an open problem if there exists an (r, t)-RS graph 
with t = rf. The best known construction in this case, due to Fischer et al. [37] , is an example with 
r = n/3 and t = However, for r = substantial progress was made recently by Alon, 

Moitra and Sudakov [8j by extending ideas used in the construction of Fox and Loh [39] discussed 
in the previous subsection. They give a construction of n-vertex graphs with rt = {1 — o{l)(^ and 
r = n}^''^'^\ That is, there are nearly complete graphs, with edge density 1 — o(l), such that its 
edge set can be partitioned into large induced matchings, each of order n^~"^^\ They give several 
applications of this construction to combinatorics, complexity theory and information theory. 

6.3 Testing small graphs 

A property of graphs is a family of graphs closed under isomorphism. A graph G on n vertices is e-far 
from satisfying a property P if no graph which can be constructed from G by adding and/or removing 
at most en^ edges satisfies P. An e-tester for P is a randomized algorithm which, given the quantity 
n and the ability to make queries whether a desired pair of vertices spans an edge in G, distinguishes 
with probability at least 2/3 between the case that G satisfies P and the case that G is e-far from 
satisfying P. Such an e-tester is a one-sided e-tester if when G satisfies P the e-tester determines that 
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this is the case. The property P is called testable if, for every fixed e > 0, there exists a one-sided 
e-tester for P whose total number of queries is bounded only by a function of e which is independent 
of the size of the input graph. This means that the running time of the algorithm is also bounded by a 
function of e only and is independent of the input size. We measure query-complexity by the number 
of vertices sampled, assuming we always examine all edges spanned by them. The infinite removal 
lemma, Theorem 13. 2| of Alon and Shapira |12| shows that every hereditary graph property, that is, a 
graph property closed under taking induced subgraphs, is testable. Many of the best studied graph 
properties are hereditary. 

If the query complexity of an e-tester is polynomial in e~^, we say that the property is easily testable. 
It is an interesting open problem to characterize the easily testable hereditary properties. Alon [3] 
considered the case where P = Ph is the property that the graph does not contain H as a subgraph. 
He showed that Ph is easily testable if and only if H is bipartite. Alon and Shapira |10] considered 
the case where P = P^ is the property that the graph does not contain H as an induced subgraph. 
They showed that for any graph H except for the path with at most four vertices, the cycle of length 
four and their complements, the property is not easily testable. The problem of determining 
whether the property P^ is easily testable for the path with four vertices or the cycle of length four 
(or equivalently its complement) was left open. The case where H is a path with four vertices was 
recently shown to be easily testable by Alon and Fox [7j. The case where H is a cycle of length four 
is still open. Alon and Fox also showed that if P is the family of perfect graphs, then P is not easily 
testable and, in a certain sense, testing for P is at least as hard as testing triangle- freeness. 

6.4 Local repairability 

The standard proof of the regularity lemma contains a procedure for turning a graph which is almost 
triangle-free into a graph which is triangle-free. We simply delete the edges between all vertex sets 
of low density and between all vertex sets which do not form a regular pair. This procedure can be 
made more explicit still by using an algorithmic version of the regularity lemma |4] . 

A surprising observation of Austin and Tao |13] is that this repair procedure can be determined in 
a local fashion. They show that for any graph H and any e > there exists 6 > and a natural 
number m such that if G is a graph containing at most Sn"^^^ copies of H then there exists a set A 
of size at most m such that G may be made H-iiee by removing at most en? edges and the decision 
of whether to delete a given edge uv may be determined solely by considering the restriction of G to 
the set {n,7;}El 

The key point, first observed by Ishigami [59] . is that the regular partition can be determined in a 
local fashion by randomly selecting vertex neighborhoods to create the partition. Since a finite set 
of points determine the partition, this may in turn be used to create a local modification rule which 
results in an H-iree graph. 

Similar ideas may also be applied to show that any hereditary graph property, including the property 
of being induced //-free, is locally repairable in the same sense. This again follows from the observation 
that random neighborhoods can be used to construct the partitions arising in the strong regularity 
lemma. 

Surprisingly, Austin and Tao show that, even though all hereditary hypergraph properties are testable, 

^^Strictly speaking, Austin and Tao 13 consider two forms of local repairability. Here we are considering only the 
weak version. 



28 



there are hereditary properties which are not locally repairable. On the other hand, they show that 
many natural hypergraph properties, including the property of being Ti-free, are locally repairable. 

6.5 Linear hypergraphs 

A linear hypergraph is a hypergraph where any pair of edges overlap in at most one vertex. For this 
special class of hypergraphs, it is not necessary to apply the full strength of hypergraph regularity to 
prove a corresponding removal lemma [66]. Instead, a straightforward analogue of the usual regularity 
lemma is sufficient. This results in bounds for in the linear hypergraph removal lemma which are 
of tower- type in a power of e~^. 

While this is already a substantial improvement on general hypergraphs, where the best known bounds 
are Ackermannianr'^l it can be improved further by using the ideas of [38]. This results in a bound of 
the form r(a-H loge~^). 

A similar reduction does not exist for induced removal of linear hypergraphs. Because we need to 
consider all edges, whether present or not, between the vertices of the hypergraph, we must apply the 
full strength of the strong hypergraph regularity lemma. This results in Ackermannian bounds. 

It is plausible that an extension of the methods of Section 12.21 could be used to give a primitive 
recursive, or even tower-type, bound for hypergraph removal. We believe that such an improvement 
would be of great interest, not least because it would give the first primitive recursive bound for the 
multidimensional extension of Szemeredi's theorem. Such an improvement would also be likely to lead 
to an analogous improvement of the bounds for induced hypergraph removal. 

Acknowledgements. The authors would like to thank Noga Alon, Zoltan Fiiredi, Vojta Rodl and 
Terry Tao for helpful comments regarding the history of the removal lemma. 
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